-
Notifications
You must be signed in to change notification settings - Fork 14
adjustments for building DB and using newer version of irsx #24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
| @@ -0,0 +1,31 @@ | |||
| export IRSX_CACHE_DIRECTORY=. | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is a Makefile I made for building the whole db.
| perl -i.bak -p -e 's/SILVERCREST ASSET ,AMAGEMENT/SILVERCREST ASSET MANAGEMENT/g' $@ | ||
| python manage.py enter_yearly_submissions 2014 | ||
|
|
||
| download_filings_% : |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It's a lot faster to download the files using this the aws-cli and then run 'load_filings' then have load filings be responsible also for downloading.
On the downside, this requires folks to have an AWS account.
| # }, | ||
| # }, | ||
| # ] | ||
| TEMPLATES = [ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This needed to be uncommented in order for any of the management commands to run.
| # row['canonical_version'] = CANONICAL_VERSION | ||
| #else: | ||
| # row['is_canonical'] = False | ||
| row['versions'] = row['versions'].split(';') |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this is for using the github version of irsx.
| @@ -1,4 +1,5 @@ | |||
| from django.db import models | |||
| from django.contrib.postgres.fields import ArrayField | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
i imagine you may not want to tie yourself to postgres. there is a portable version of jsonfield that could be used here.
| try: | ||
| FILE_SYSTEM_BASE = settings.FILE_SYSTEM_BASE | ||
| except ImportError: | ||
| except AttributeError: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this is the correct exception catcher.
| partname = part | ||
| partdata = sked['schedule_parts'][part] | ||
| #print("part %s %s" % (partname, partdata)) | ||
| #self.stdout.write("part %s %s" % (partname, partdata)) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
just using the django command guidance here.
| self.stdout.write("run_filing %s" % object_id) | ||
|
|
||
| # if we get a bad xml file, delete the file and retry once | ||
| try: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this was useful for me.
|
Thanks so much for sharing this @forest Gregg <[email protected]>, excited
to take a look. in the next few days. You're reading is correct; at the
outset I didn't want to require an amazon account, but the download with
aws is so much better with one it probably makes sense to use it (and
everyone working with this data at scale probably has one anyways).
…On Wed, Jul 17, 2019 at 3:02 PM Forest Gregg ***@***.***> wrote:
Hi @jsfenfen <https://github.com/jsfenfen>, this PR has a number of
different pieces that you may or not be interested in. I can split things
apart to just get you what you do care about.
------------------------------
You can view, comment on, or merge this pull request online at:
#24
Commit Summary
- adjustments for building DB and using newer version of irsx
File Changes
- *M* README.md
<https://github.com/jsfenfen/990-xml-database/pull/24/files#diff-0>
(2)
- *A* irsdb/Makefile
<https://github.com/jsfenfen/990-xml-database/pull/24/files#diff-1>
(31)
- *M* irsdb/irsdb/settings.py
<https://github.com/jsfenfen/990-xml-database/pull/24/files#diff-2>
(30)
- *M* irsdb/metadata/management/commands/load_metadata.py
<https://github.com/jsfenfen/990-xml-database/pull/24/files#diff-3>
(6)
- *M* irsdb/metadata/models.py
<https://github.com/jsfenfen/990-xml-database/pull/24/files#diff-4>
(16)
- *M* irsdb/metadata/views.py
<https://github.com/jsfenfen/990-xml-database/pull/24/files#diff-5>
(4)
- *M* irsdb/return/management/commands/load_filings.py
<https://github.com/jsfenfen/990-xml-database/pull/24/files#diff-6>
(38)
Patch Links:
- https://github.com/jsfenfen/990-xml-database/pull/24.patch
- https://github.com/jsfenfen/990-xml-database/pull/24.diff
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#24?email_source=notifications&email_token=AAE4AJFEEWAZBBZE6NS7F63P76JFVA5CNFSM4IEU535KYY3PNVWWK3TUL52HS4DFUVEXG43VMWVGG33NNVSW45C7NFSM4G73QODA>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AAE4AJELCDXJB5JFF7BU4PTP76JFVANCNFSM4IEU535A>
.
|
|
Would recommend splitting off the change from |
Hi @jsfenfen, this PR has a number of different pieces that you may or not be interested in. I can split things apart to just get you what you do care about.